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We tested 12 bat species in Ghana for coronavirus 
(CoV) RNA. The virus prevalence in insectivorous bats 
(n = 123) was 9.76%. CoV was not detected in 212 fecal 
samples from Eidolon helvum fruit bats. Leaf-nosed bats 
pertaining to Hipposideros ruber by morphology had group 
1 and group 2 CoVs. Virus concentrations were <45,000 
copies/100 mg of bat feces. The diversified group 1 CoV 
shared a common ancestor with the human common cold 
virus hCoV-229E but not with hCoV-NL63, disputing hypoth- 
eses of common human descent. The most recent common 
ancestor of hCoV-229E and GhanaBt-CoVGrp1 existed 
in 1686-1800 av. The GhanaBt-CoVGrp2 shared an old 
ancestor (~2,400 years) with the severe acute respiratory 
syndrome—ike group of CoV. 


Ce (CoVs) (order Nidovirales, family Coro- 
naviridae, genus Coronavirus) are enveloped viruses 
with plus-stranded RNA genomes of 26—32 kb, the largest 
contiguous RNA genomes in nature (/). They are classi- 
fied into 3 groups, which contain viruses pathogenic for 
mammals (groups | and 2) and poultry (group 3) (/). Hu- 
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man CoVs (hCoVs)-229E, -NL63, -OC43, and -HKU1 are 
endemic worldwide and cause mainly respiratory infec- 
tions in children and adults. The severe acute respiratory 
syndrome (SARS) coronavirus (SARS-CoV) is a novel 
zoonotic coronavirus that caused an international epidemic 
in 2002-2003. Fortunately, efficient public health manage- 
ment interrupted this epidemic (2). Studies conducted in 
China in the aftermath of the SARS epidemic have identi- 
fied CoVs in bats (Chiroptera) and implicated this speciose 
mammalian order as the most likely reservoir of all known 
coronaviruses (3-7). Among the most urgent concerns 
prompted by the SARS epidemic is the likelihood of simi- 
lar future events. Thus, it seems highly relevant to study the 
ecology of bat CoVs in terms of diversity, host restriction, 
virus prevalence, risk of exposure, and the circumstances 
of past host transition events. 

The genetic diversity of bat-borne CoVs is currently 
unclear. Preliminary data suggest that CoVs may be adapt- 
ed in a stricter sense to a specific host species rather than to 
specific regions (5,6,8—/2). A variety of pathogenic CoVs 
occur in other mammals or poultry. However, the genetic 
range within these animals is considerably less than that 
observed in even single bat species or subfamilies (7, 8). 

Estimates indicate that there are >100 bat species in 
sub-Sahran Africa. This finding is in contrast to ~50 species 
in the entire Western Palaearctic region (Europe, Middle 
East, North Africa) (/3,/4). African bats have been shown 
to harbor pathogens that are occasionally transmitted to hu- 
mans. This transmission may result in severe disease out- 
breaks, e.g., Ebola and Marburg viruses (/5). Because bats 
are a part of the human diet in wide areas of Africa (/6), it 
appears highly relevant to study CoVs in African bats. 
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We have demonstrated by serologic studies that Af- 
rican bats have antibodies against CoVs (/0). Antibod- 
ies reactive with SARS-CoV antigen were detected in 47 
(6.7%) of 705 bat serum specimens from 26 species (/0). 
Recently, Tong et al. detected sequences of CoVs in bats 
from Kenya (/7). We describe the results of studies on bats 
in Ghana obtained by using noninvasive sampling of fru- 
givorous and insectivorous bats at 2 caves, a lake habitat of 
diverse insectivorous bats, and a large urban roosting site 
of frugivorous bats. Bayesian inference of diversification 
dates gave implications on the recency of the introduction 
of hCoV-229E into the human population, irrespective of 
its original source. 


Materials and Methods 


Capturing and Sampling 

In the locations identified in Figure 1, mist netting and 
sampling were conducted as described (//). In Kumasi 
Zoo, fecal samples were collected with plastic foil under 
trees occupied by Eidolon helvum bats (estimated colony 
size 300,000). For all capturing and sampling, permission 
was obtained from the Wildlife Division of the Ministry of 
Lands, Forestry, and Mines in Ghana. Research samples 
were exported under a state agreement between the Repub- 
lic of Ghana and the Federal Republic of Germany, rep- 
resented by the City of Hamburg. Additional export per- 
mission was obtained from the Veterinary Services of the 
Ghana Ministry of Food and Agriculture. 


Processing and Analysis of Samples 

Samples (14 fecal pellets or swabs suspended in 
RNA sstabilization solution [RNAlater Tissue Collec- 
tion; Applied Biosystems, Foster City, CA, USA]) were 
tested at the Kumasi Centre for Collaborative Research 
in Tropical Medicine as described (//,/8). After ini- 
tial sequencing, specific primers were designed for each 
group of CoV found. Nested reverse transcription—PCR 
(RT-PCR) primer sets used for sequencing of longer frag- 
ments of representative viruses are available upon re- 
quest. The following sequences were derived from this 
study and were submitted to GenBank under the listed 
accession numbers: RNA-dependent RNA polymerase 
(RdRp) sequences: BtCoV/Hip/GhanaBoo/348/2008, 
FJ710043; BtCoV/Hip/GhanaBoo/344/2008, FJ710044; 
BtCoV/Hip/GhanaKwam/8/2008, FJ710045; BtCoV/Hip/ 
GhanaKwam/19/2008, FJ710046; © BtCoV/Hip/Ghana 
Kwam/20/2008, FJ710047; BtCoV/Hip/GhanaKwam/ 
13/2008, FJ710048; BtCoV/Hip/GhanaKwam/3 1/2008, 
FJ710049; BtCoV/Hip/GhanaKwam/27/2008, FJ710050; 
BtCoV/Hip/GhanaK wam/26/2008, FJ710051; BtCoV/ 
Hip/GhanaKwam/24/2008, FJ710052; BtCoV/Hip/Ghana 
Kwam/10/2008, FJ710053; BtCoV/Hip/GhanaK wam/22/ 
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2008, FJ710054/nucleocapsid sequences; BtCoV/Hip.sp/ 
GhanaBoo/344/2008, FJ710055; BtCoV/Hip.sp/Ghana 
Kwam/19/2008, FJ710056. 


Phylogenetic Analysis 

Nucleic acid alignments were conducted based on 
amino acid code by using the ClustalW algorithm (www. 
ebi.ac.uk/clustalw) in the Molecular Evolutionary Genetics 
Analysis version 4.0 software package (www.megasoft- 
ware.net) (19). Two gap-free nucleotide alignments (817 
bp and 1,221 bp) were generated. Tree topologies were de- 
termined on both datasets by using MrBayes version 3.1 
(20). The analysis used a general time reversible (GTR) 
substitution model, with 6 rate categories to approximate a 
gamma-shaped rate distribution across sites and an invari- 
ant site assumption (GTR + 6 + I). Markov chain Monte 
Carlo (MCMC) chains of 107 iterations were sampled every 
500 generations, resulting in 20,000 sampled trees. Two 
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Figure 1. Location of Kwamang caves near the village of 
Kwamang, (6°58'N, 1°16"W), 50 km northeast of Kumasi, Ashanti 
region, Ghana. Booyem caves A (7°43'24.9"N, 1°59'16.5"W) 
and B (7°43'25.7"N, 1°59'33.5"W) are located near remote small 
settlements in the vicinity of Booyem, Brong-Ahafo region. Lake 
Bosumtwi is located 30 km southeast of Kumasi (6°32'22.3"N, 
1°24'41.5"W). The botanical gardens of Kwame Nkrumah National 
University of Science and Technology are located on campus in the 
city of Kumasi (6°41'6.4"N, 1°33'42.8"W). Kumasi Zoo is located in 
the center of the city (6°42'2.0"N, 1°37'29.9"W). 
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Metropolis-coupled chains (1 cold and 3 heated chains 
each) were run in parallel, compared, and pooled. Con- 
vergence of chains was confirmed by the potential scale 
reduction factor statistic in MrBayes (2/) and by visual in- 
spection of each cold chain using the TRACER program 
(22). Phylogenetic dating was conducted by using Bayes- 
ian evolutionary analysis sampling trees (BEAST) (22). 
Chain lengths in BEAST were at least 20,000,000 genera- 
tions with sampling every 500 generations. Convergence of 
the model was checked visually and by the effective sample 
size statistic with TRACER. 


Results 


Virus Detection 

During February 2008, bats were sampled in the de- 
scribed locations around Kumasi, Ghana. Initially, 7 fecal 
samples tested positive by pan-CoV PCR. Products (440 
bp, RdRp gene) were sequenced and aligned with proto- 
type CoV. Neighbor-joining phylogenies indicated 2 dis- 
tinct groups of sequences that belonged to CoV group | 
(n = 4) and group 2 (n = 3), respectively. Specific primer 
pairs for the group | and group 2 sequences were designed 
and applied again to all samples. Five additional viruses 
were found, resulting in a total CoV prevalence of 9.76% 
in insect-eating bats (n = 123). No virus was found in any 
oral swab. All virus findings in fecal samples are listed by 
capture site in Table 1. 

Notably, all CoV findings were in insect-eating leaf- 
nosed bats of the genus Hipposideros. Within the genus, 
the species H. abae could be discriminated unambigu- 
ously by morphology (Table 1). The remaining Hippo- 
sideros species were assigned to the complex of forms 
related to currently recognized species H. caffer and H. 


Table 1. Overview of bats studied, Ghana 


Bat Coronaviruses, Ghana 


ruber. Because 2 morphotypes were present (Figure 2), 
the mitochondrial cytochrome b gene was sequenced as 
described (23). Both morphotypes belonged to phyloge- 
netic lineages distinct from H. caffer and possibly repre- 
sented 2 distinct species (P. Vallo, personal ongoing in- 
vestigation). Both are collectively referred to as H. caffer 
(cf.) ruber in this study. A fraction of 15.4 % of H. cf. 
ruber specimens yielded CoV, without a difference be- 
tween sexes (14%/19%, n = 57/21 [M/F], respectively). 
Only adult males and nonlactating adult females, but no 
lactating females, juveniles, and subadults of H. cf ruber 
were encountered. 


Virus Concentrations 

To estimate the quantity of CoV genomes in bat feces, 
we did end-point dilution experiments with the nested pan- 
CoV RT-PCR (/8). The previously determined sensitivity 
limit of the PCR assay was 5—45 copies/PCR (/8). In the 
assay, the equivalent of 1 mg feces was tested per PCR tube 
(100 mg feces collected, 1:10 dilution extracted, 1:10 dilu- 
tion tested). The highest dilution factor that still yielded an 
amplification signal in any of the samples was 1:10, which 
suggested a maximal concentration of 50 to 450 CoV RNA 
copies/mg of feces. 


Virus Classification 


Group 1 CoV 

In H. cf. ruber bats in the Kwamang and Booyem 
caves, a diverse group 1 CoV was found. Further analy- 
sis was complicated by the low RNA content in samples. 
Based on alignments of prototype group | viruses, 5 differ- 
ent nested RT-PCRs were designed and the RdRp fragment 
could finally be extended by 441 bp to the 5’ end, providing 


No. positive 
Sampling site Species No. fecal samples (group 1/group 2) 

Zoo Kumasi (6°42'2.0"N, 1°37'29.9"W) Eidolon helvum 212* 0 
KNUST Botanical Garden Kumasi (6°41'6.4"N, 1°33'42.8"W)t Pipistrellus nanulus 1 0 
Glauconycteris beatrix 1 0 
Lake Bosumtwi (6°32'22.3"N, 1°24'41.5"W) Chaerephon spp. 6 0 
Nycteris hispida 1 0 
P. nanus 5 0 
P. deserti 1 0 

Cave Kwamang (6°58'N, 1°16’W) Hipposideros caffer ruber 40 10 (4/6) 
H. abae 13 0 
Cave Booyem A (7°43'24.9"N, 1°59'16.5"W) H. cf. ruber 8 0 
Coleura afra 12 0 

Cave Booyem B (7°43'25.7"N, 1°59'33.5"W) H. cf. ruber$ 11 2 (1/1) 
H. abae 3 0 
Coleura afra 21 0 
Total 335 12 


*These samples were collected without individual association to bats. Due to a low sampling frequency, it can be assumed that each sample was from an 


individual bat. 
t+tKwame Nkrumah University of Science and Technology. 
+Two morphotypes were observed. 
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Figure 2. Two morphotypes of Hipposideros caffer ruber, held by 
one of the authors (F.G.-R.), who was wearing a leather glove. 
Photograph courtesy of Antje Seebens. 


an 817-nt fragment for phylogenetic analysis. All methods 
of phylogenetic inference placed this virus next to a com- 
mon ancestor with human coronavirus 229E, which circu- 
lates worldwide in humans (Figure 3). Bootstrap support of 
the hCoV-229E/GhanaBt-CoVGrpl root point in neighbor- 
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joining analysis was 100%. The corresponding Bayesian 
posterior probability was 1.0. The most closely related 
member of the GhanaBt-CoVGrp1 clade shared 91.90% 
nucleotide identity with hCoV-229E in the analyzed frag- 
ment. The most distant member was 86.50% identical. 
The next phylogenetic neighbor, the human CoV hCoV- 
NL63, was only 74.70%—78.60% identical in the analyzed 
fragment. 


Group 2 CoV 

With the pan-CoV screening assay, a group 2 CoV 
was initially found in the Kwamang cave. Sequences from 
3 bats were identical. The secondary group-specific PCR 
identified 4 additional samples of this virus, | of them from 
Booyem Cave B and the remaining from Kwamang. Nu- 
cleotide identity among these sequences was 97.2%—100%. 
Phylogenetic analysis with different methods of inference 
(neighbor-joining nucleotide-based, neighbor-joining ami- 
no acid-based, Bayesian) yielded variable tree topologies 
suggesting basal associations with either the 2a, 2d, or 2b 
subgroups (data not shown) (24). Based on alignments of 
prototype group II viruses, 8 additional nested RT-PCR 
primer sets were designed and 2 of the samples could be 


SARS-CoOvP rab armen 2003 
AID ViDemscueterCKI TAY 
TurtaryCOIATOOYR-91 1 


Figure 3. A) Phylogeny of coronaviruses (CoVs) in the RNA- 
dependent RNA polymerase gene (RdRp, 817-bp fragment) with 
root point dates derived from Bayesian inference under a relaxed 
lognormal molecular clock assumption with a codon-based 
substitution model (SRDO6) and an assumption of expansion 
growth of the virus population. Estimated dates of diversification of 
CoV lineages at root points are shown in italics for the expansion 
growth population model and in regular type for the exponential 
growth model. Dates sc are identified with a suffix; dates ap are 
not. B) Bayesian phylogeny from the CoV group 1 root, using the 
nucleocapsid (N) gene. Highest posterior densities for all root points 
were >0.99, except where indicated. 
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amplified. Sequences could be extended 520 bp upstream 
and 383 bp downstream of the initial fragments, yielding 
1,221-bp fragments for phylogenetic analysis. Bayesian 
phylogenetic inference with different substitution models 
and parallel analysis using Metropolis coupling now placed 
the virus reliably next to a common ancestor with the 2b 
group of CoV (SARS-like viruses, Figure 3). The Bayes- 
ian posterior probability of the CoV 2b/GhanaBt-CoVGrp2 
clade being monoyphletic was 1.0. A maximum of 72.2% 
nucleotide identity was shared with SARS CoV. 


Molecular Dating 

Reliable isolation dates were researched in the litera- 
ture for each employed virus. Because a reliable molecular 
clock dating existed for the most recent common ances- 
tor (MRCA) of the hCoV-OC43/bovine CoV pair (25), 
this date was set as a normal-distributed probabilistic prior 
within the published ranges (25) for calibration of all analy- 
ses. A first analysis was conducted on the 1,225-bp dataset 
that did not include the novel GhanaBt-CoVGrp1. All virus 
sequences were assumed to be contemporary. Phylogeny 
was inferred using a GTR + I'4 + I model. The resulting 
MRCA date of the CoV2b (SARS-like)/GhanaBt-CoVGrp2 
clade was 260 ap and that of the hCoV-NL63/-229E pair 
was 981 Ap (see Table 2 for details). To include the novel 
GhanaBat-CoVGrp1, we repeated the same analysis by us- 
ing the 817-bp dataset. The resulting MRCA date of the 
hCoV-NL63/229E pair was 816 Ap in this analysis, which 
was in good concordance with results from the 1,221-bp 
dataset (Table 2) and also with previously published data 
(26). The diversification estimate for the novel group | bat- 
CoV and hCoV-229E then was 1803 ap. 


Bat Coronaviruses, Ghana 


Because it has been suggested that codon-based evolu- 
tionary models may be preferred for Bayesian phylogenetic 
inference from protein-coding datasets (27), analyses on the 
817-bp dataset were repeated by using the SRD06 substitu- 
tion model in BEAST. This analysis did not yield a differ- 
ent substitution rate, but resulted in older resulting MRCA 
dates (Table 2). A Bayes factor test conducted in TRACER 
yielded a strong estimate of superiority of the codon-based 
model over the GTR + 4 + I model (log,, Bayes factor 139 
[20 is highly significant]). To further optimize the predic- 
tion of MRCA dates, the constant population size assump- 
tion used in all analyses was exchanged against expansion 
growth or exponential growth assumptions. Both assump- 
tions were predicted to fit the data better than the constant 
size model (Bayes factors 13.5 and 13.9). There was no 
difference between the expansion and exponential mod- 
els (Bayes factor 0.34 in favor of expansion). The MRCA 
date of hCoV-229E and the GhanaBt-CoVGrp1 was 1686 
(expansion) or 1800 (exponential growth). Table 2 sum- 
marizes the results. Figure 3 shows a dated phylogeny of 
coronaviruses with MRCAs according to the 2 last men- 
tioned analyses. 


Recombination 

To determine whether CoV recombination might play 
a role in the studied virus population, the structural nucleo- 
capsid gene was amplified using 8 nested RT-PCR primer 
sets that had been designed on alignments of all available 
CoV group | nucleocapsid sequences. Using a similar ap- 
proach, we also tested the same samples for CoV group 2 
nucleocapsid sequences. Only group 1 RT-PCRs yielded 
fragments. These fragments could be combined into contig- 


Table 2. Results of molecular clock analyses, study of coronaviruses in bats, Ghana* 


Mean MRCA (95% Cl, HPD)+ 
substitution rate SARS-like/ HCoV-229E/ 
Alignment, = (substitutions/ Population model, Root point GhanaBt- hCoV-229E/ GhanaBt- 
bp site/year) substitution model (range)t SARS-like§ CoVGrp2 hCoV-NL63 CoVGrp1 
1,221 2.1« 10" Constant size, 2243 1905 AD 260 981 - 
(1.2-3.1 x 10) nucleotide (4521-290) (1867-1941) (792 Bc-1178) (161 BC—1324) 
(GTR+G +1) 
817 2.1 x 10" Constant size, 2053 1852 AD 348 816 1803 
(1.5-2.7 x 10“) nucleotide (3433-731) (1612-1852) (119 BC-820) (320-1290) (1694-1906) 
(GTR+G +l) 
817 1.6 x10~ Constant size, 4500 1674 768 BC 168 1659 
(0.8-2.5 x 10“) codon-based (7305-1918) (1516-1804) (2037 Bc-360) (1111 BC—721) (1503-1804) 
(SRDO6) 
817 15x 104 Expansion growth, 5024 1628 384 BC 20 1686 
(0.9-2.2 x 10“) codon-based (9261-1360) (1379-1836) (2060 BC-1074) (1347 Bc-1174) (1460-1871) 
(SRDO6) 
817 1.8x 104 Exponential growth, 4765 1667 425 460 1800 


codon-based 
(SRDO6) 


(0.9-2.8 x 107) 


(7999-1707) (1436-1853) 


(1544 Bc-1193) (956 BC-1271) (1501-1883) 


*MRCA, most recent common ancestor; Cl, confidence interval; HPD, high population density; SARS, severe acute respiratory syndrome; hCoV, human 
coronavirus; GTR +I + |, general time reversible gamma-shaped rate distribution across sites and an invariant site assumption. 


tEstimation of the year (BC) of the most recent common ancestor. 


Estimation of the year of the most recent common ancestor of extant CoV. All years AD except as indicated. 


§CoV group 2b without novel Bt-CoV from this study (Figure 2). 
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uous 1,030-nt sequences for Bt-CoV GhanaKwam 19 and 
1,176 nt for Bt-CoV GhanaBoo 344. As shown in Figure 3, 
panel B, the resulting phylogenetic placement was exactly 
matching that of the RdRp fragments, giving no evidence of 
recombination between the RdRp region located in the mid- 
dle of the genome and the nucleocapsid gene located at the 
extreme downstream end. Sequencing of the nucleocapsid 
gene of the GhanaBt-CoVGrp2 was not successful when 
we used 15 nested RT-PCRs designed on alignments of all 
available CoV 2b nucleocapsid sequences. Amplification 
with above mentioned nested RT-PCRs for CoV group 1 
was also unsuccessful. 


Discussion 

In the aftermath of the SARS epidemic, bats have been 
identified as carriers of CoV in China (3—7). Furthermore, in 
addition to our earlier finding of antibodies against CoVs in 
various African bats (/0), we have confirmed the presence 
of CoV in bats of Ghana. Together with recent data from 
Germany, North America, Trinidad, and Kenya (//,/2,28), 
these findings suggest that the association of CoV with bats 
is a worldwide phenomenon. The prevalence of CoV in 
insect-eating bats (9.76%) matched our previous findings 
in Germany. However, in that study we sampled during 
the breeding season and showed that CoVs are most likely 
amplified in maternity roosts (//). The composition of the 
catch in this study (no lactating females, no young bats) 
suggests sampling outside the breeding season and may 
not be directly comparable. Future studies relating to risks 
of exposure should address whether virus prevalence may 
change over time. 

The risk of exposure was also addressed by investi- 
gations of virus concentration. Several groups have shown 
that CoVs are almost exclusively detected in bat feces and 
not, as hypothesized earlier, in saliva (3,4,28,29). Surpris- 
ingly, little virus was found in all fecal samples tested in 
our study. We estimated the RNA concentration per full 
sample (100 mg feces = 2-4 fecal pellets) to be only up to 
4.5 x 104 RNA copies. Human pathogenic viruses trans- 
mitted by the fecal-oral route generate much higher virus 
concentrations in stool, up to ~10’? RNA copies/mg, e.g., 
for different picornaviruses (30-32). Based on these data it 
would be difficult to postulate that humans can acquire CoV 
from bat feces. However, studies in other locations and at 
different times are needed to address virus concentration 
in bat droppings in more detail. Because virus in this study 
was only observed in insectivorous bats and not in frugivo- 
rous bats, future studies should investigate whether insects 
might constitute a source of CoV infection for bats. 

To achieve a direct prediction of the potential of bat 
CoVs to infect human cells, it would be highly relevant 
to conduct virus isolation studies on bat feces. However, 
in our study we sampled no more than 100 mg of feces 
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per bat. All samples had to be collected in RNAlater so- 
lution (0.5 mL) (Applied Biosystems) for reasons of stor- 
age and transportation. Although it has been suggested that 
RNAlater solution may preserve virus infectivity (33,34), 
our observations showed that the solution has to be diluted 
at least 1:20 in cell culture medium to avoid cytotoxicity 
(data not shown). Because of the low virus RNA concen- 
trations observed, we did not attempt to isolate the virus. 
However, the absence of successful virus isolation from bat 
feces in previous studies (3—6,8,//,/2) may not reflect the 
incapability of bat CoV to infect human cells. Recently, a 
synthetic bat CoV complemented with an appropriate spike 
protein has shown potential to infect human cells (35). 

Reconstruction of phylogenetic and temporal relation- 
ships between bat CoV and other mammalian CoV is an- 
other way to obtain information on their zoonotic potential. 
Unfortunately, for CoV long sequence fragments must be 
analyzed before valid phylogenies can be inferred from the 
conserved nonstructural genome portion (28,36). Because 
of the low concentration of RNA in bat samples, generation 
of long sequences from novel bat CoV is tedious and techni- 
cally demanding, which may be why some published phy- 
logenies of bat CoV are based on short datasets, making it 
difficult to use these data for reference. For molecular clock 
dating, we have therefore relied on reference viruses main- 
ly from other mammals that covered our 1,221-bp fragment 
in the conserved RdRp region. We assumed that the RdRp 
would be under less selective pressure than the structural 
genes and other nonstructural genes, and therefore could 
be used to infer nucleotide substitution rates over distant- 
ly related CoVs (7,25,26,36-38). We have confirmed all 
tree topologies using alternative methods of phylogenetic 
inference, including an MCMC algorithm implemented in 
MrBayes that eliminates artifacts contributed by fixation of 
MCMC chains in suboptimal prosterior probability max- 
ima (20). Calibration was conducted on reliable isolation 
dates of prototype and novel bat CoV from the literature, 
as well as on the MRCA of the hCoV-OC43/Bovine CoV 
clade. For dating of only this specific CoV clade, a wide 
range of dated virus isolates has been available that covered 
as much as 34% (1965-2004) of the projected time of virus 
evolution from root to tip (1890-2004) (25). A probabilis- 
tic calibration prior was used, which is favorable for dating 
in combination with relaxed molecular clock assumptions 
(39). The determined mean substitution rates were in good 
concordance with earlier studies on non-bat—CoV that used 
maximum likelihood-based methods in addition to Bayes- 
ian inference (25,26,38,40). 

Although the exponential growth prior on the virus 
population seemed equivalent with an expansion growth 
model by the Bayes factor test and produced highly com- 
patible MRCAs, the exponential model produced a bet- 
ter match with the previously determined MRCA of the 
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HCoV-NL63/HCoV-229E pair (26). Because Pyrc et al. 
generated these data by 3 alternative approaches (Bayes- 
ian, serial unweighted pair group method with arithmetic 
mean, maximum likelihood [26]), we used their MRCA to 
validate our results, and consequently prefer the MRCA 
dating from the exponential growth population model (as 
presented in Figure 3 in plain type). One earlier study on 
bat and non-bat CoV suggested a much faster evolutionary 
rate for CoV than other studies (7). As Vijaykrishna et al. 
pointed out, their results were associated with large confi- 
dence intervals caused by the lack of available data on Bt 
CoV at the time the study was conducted (7). The increase 
of available sequence data now enables a better account of 
CoV evolutionary history. 

All CoVs in our study were found in members of the 
genus Hipposideros (family Hipposideridae). The genus 
Rhinolophus from the sister family Rhinolophidae was 
found to host SARS-like viruses in several studies in China. 
One of our Hipposideros CoVs was in a basal phylogenetic 
relationship with the SARS-like clade (group 2b); their 
most recent common ancestors date back to ~400 Bc. Tong 
et al. (/7) have detected a sequence fragment of a bat CoV 
in Kenya that also belongs to the 2b clade but is associated 
with the genus Chaerephon, a free-tailed bat that is rather 
distantly related to the genus Hipposideros. Although these 
authors analyzed only a short sequence fragment, their 2b 
CoV seems to be related more closely to SARS CoV than 
the virus found in our study. In the many studies conducted 
in China, only closely related members of the 2b group 
were detected, with the most basal members dating back 
only to the 17th century, according to our analysis. The co- 
occurrence of basal and closely related viruses in Africa, as 
well as the existence of the same virus clade in bats other 
than those of the family Hipposideridae, may entail specu- 
lations about a possible origin of the SARS-like group of 
CoVs in Africa rather than in Asia. 

Another result that should be integrated with earlier 
findings is the surprisingly recent date of the MRCA of 
the novel Grp! Bt CoV and the human common cold virus 
hCoV-229E. Further to the proven recent host switching of 
SARS CoV, Vijgen et al. have suggested that hCoV-OC43 
entered the human population ~120 years ago, causing a 
pandemic (25). This virus was most likely acquired by hu- 
mans from domestic cattle. Results of our study show that 
it is not unlikely that hCoV-229E, which today is circulat- 
ing worldwide in humans, resulted from a host switching 
event not more than 208-322 years ago. However, as with 
molecular clock dating of viruses, associated confidence 
limits should not be overlooked. 

Because H. cf. ruber bats are found only in sub-Saha- 
ran Africa and are not migratory (23), it would be relevant 
to know how tightly the associated CoV is restricted to its 
host. Despite the statistical limitations of our rather small 
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sample size, the absence of CoV in bats of the closely relat- 
ed species H. abae that were tested in our study in 2 differ- 
ent caves speaks in favor of tight host restriction. Another 
supportive argument is the absence of CoV in C. afra, a bat 
species sampled in sufficient numbers at the Booyem cave. 
This cave was coinhabited by CoV-positive H. cf. ruber 
bats. If tight host restriction to nonmigratory H. cf ruber 
bats existed, this would indicate an origin of hCoV-229E 
within the geographic range of its host, 1.e., the rainforest 
belt and the wet forested savannahs of sub-Saharan Af- 
rica (23). Unfortunately, it will be difficult to reconstruct 
whether the projected host transition event might have been 
associated with human epidemic disease. 
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